Fast Multi-view Clustering via Ensembles: Towards Scalability, Superiority, and Simplicity

نویسندگان

چکیده

Despite significant progress, there remain three limitations to the previous multi-view clustering algorithms. First, they often suffer from high computational complexity, restricting their feasibility for large-scale datasets. Second, typically fuse information via one-stage fusion, neglecting possibilities in multi-stage fusions. Third, dataset-specific hyperparameter-tuning is frequently required, further undermining practicability. In light of this, we propose a fast xmlns:xlink="http://www.w3.org/1999/xlink">m ulti-v xmlns:xlink="http://www.w3.org/1999/xlink">i ew xmlns:xlink="http://www.w3.org/1999/xlink">c lustering xmlns:xlink="http://www.w3.org/1999/xlink">e nsembles (FastMICE) approach. Particularly, concept random view groups presented capture versatile view-wise relationships, through which hybrid early-late fusion strategy designed enable efficient With multiple views extended xmlns:xlink="http://www.w3.org/1999/xlink">many groups, levels diversity (w.r.t. features, anchors, and neighbors, respectively) are jointly leveraged constructing view-sharing bipartite graphs early-stage fusion. Then, set diversified base clusterings different obtained fast graph partitioning, formulated into unified final late-stage Notably, FastMICE has almost linear time space free tuning. Experiments on 22 datasets demonstrate its advantages scalability (for extremely large datasets), superiority (in performance), simplicity (to be applied) over state-of-the-art. Code available: https://github.com/huangdonghere/FastMICE .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-objective Multi-view Spectral Clustering via Pareto Optimization

Traditionally, spectral clustering is limited to a single objective: finding the normalized min-cut of a single graph. However, many real-world datasets, such as scientific data (fMRI scans of different individuals), social data (different types of connections between people), web data (multi-type data), are generated from multiple heterogeneous sources. How to optimally combine knowledge from ...

متن کامل

Multi-View Clustering and Feature Learning via Structured Sparsity

Combining information from various data sources has become an important research topic in machine learning with many scientific applications. Most previous studies employ kernels or graphs to integrate different types of features, which routinely assume one weight for one type of features. However, for many problems, the importance of features in one source to an individual cluster of data can ...

متن کامل

Partial Multi-View Clustering

Real data are often with multiple modalities or coming from multiple channels, while multi-view clustering provides a natural formulation for generating clusters from such data. Previous studies assumed that each example appears in all views, or at least there is one view containing all examples. In real tasks, however, it is often the case that every view suffers from the missing of some data ...

متن کامل

Multi-View Clustering via Joint Nonnegative Matrix Factorization

Many real-world datasets are comprised of different representations or views which often provide information complementary to each other. To integrate information from multiple views in the unsupervised setting, multiview clustering algorithms have been developed to cluster multiple views simultaneously to derive a solution which uncovers the common latent structure shared by multiple views. In...

متن کامل

Multi-view Sparse Co-clustering via Proximal Alternating Linearized Minimization

When multiple views of data are available for a set of subjects, co-clustering aims to identify subject clusters that agree across the different views. We explore the problem of co-clustering when the underlying clusters exist in different subspaces of each view. We propose a proximal alternating linearized minimization algorithm that simultaneously decomposes multiple data matrices into sparse...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Knowledge and Data Engineering

سال: 2023

ISSN: ['1558-2191', '1041-4347', '2326-3865']

DOI: https://doi.org/10.1109/tkde.2023.3236698